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IDENTIFYING INDUSTRY SECTORS USING 
STATISTICAL CLUSTERIZATION 

5 BACKGROUND OF THE INVENTION 

Field of the Invention . 

The present invention relates to techniques for identifying industry sectors and 
classifying particular companies into those sectors. 

10 

Description of the Related Art . 

For some time now, it has been common to group companies and their 
corresponding stocks into various business sectors. In theory, after having done so, 
the performance of any company can be compared against the performance of other 

15 companies in the same sector. These types of comparisons can have important 
implications in portfolio management and financial planning. 

In addition, the sectoral statistics themselves can provide significant 
information regarding the macroeconomy and the prospects for other related and/or 
dependent industries. For example, a significant decline in profits in an agricultural 

20 sector may have a strong correlation with future sales of farm equipment. 

Conventionally, each sector has been defined to include a particular type of 
business. Thus, in theory, each company can be assigned to the sector 
corresponding to its line of business. Unfortunately, many companies are diversified 
and therefore cross conventional sector boundaries. In addition, often times, 

25 companies that on the surface appear to be engaged in similar business are in fact 
affected by significantly different market forces. Still further, even the sector 
definitions themselves frequently fail to keep track of changing technology and 
changing business models, meaning for example that a high-growth evolving 
technology may be grouped with an older well-established technology. 

30 In short, the conventional techniques typically have serious problems both 

with sector definitions and with assigning companies to particular sectors once those 
sectors have been defined. Each of these problems with conventional sector-based 
approaches can have the effect of significantly skewing the resulting sectoral 
statistics, substantially reducing the value of any comparison to such statistics. 

1 
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SUMMARY OF THE INVENTION 

The present invention addresses this need by grouping stocks based on 
similarities of their elasticities to, sensitivities to, or similar measures of their 

5 tendencies to change in value as a result of changes in the values of a number of 
different exogenous variables. 

Thus, the invention is directed to classifying stocks into business sectors by 
calculating, for each of multiple exogenous variables, a measure of a tendency for 
a value of a stock to change as a result of a change in a data value for each such 

10 exogenous variable. The foregoing step is then repeated for each of several 
different stocks. Finally, the different stocks are grouped into different sectors based 
on similarities of such measures of tendency to change. 

By grouping stocks in this manner, the present invention often can provide for 
automatic and dynamic definitions of industry sectors and simultaneous classification 

15 of different stocks into those sectors. As a result, many of the problems of 
conventional sectoral analysis techniques typically can be avoided. Moreover, 
because sectorization according to the present invention is based on elasticities, 
sensitivities or similar measures, the identified sectors often will more appropriately 
group similarly situated companies. 

20 The foregoing summary is intended merely to provide a brief description of the 

general nature of the invention. A more complete understanding of the invention can 
be obtained by referring to the claims and the following detailed description of the 
preferred embodiments in connection with the accompanying figures. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a flow diagram for explaining evaluation and screening of assets 
according to a representative embodiment of the invention. 

Figure 2 is a flow diagram for explaining portfolio evaluation and screening 
according to a representative embodiment of the invention. 
30 Figure 3 illustrates display of asset elasticity information according to a 

representative embodiment of the present invention. 
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Figure 4 is a block diagram of a general-purpose computer system, 
representing one suitable computer platform for implementing the methods of the 
present invention. 

5 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The following disclosure pertains to multiple inventions that are claimed in 
separate patent applications. The commonly assigned patent applications filed of 
even date herewith and titled, "Sensitivity/Elasticity-Based Asset Evaluation and 
Screening" and "Significance-Based Display" are incorporated herein by reference 
10 as though set forth herein in full. 

Asset Evaluation and Screening . 

The present invention provides asset evaluation and screening techniques 
that may be incorporated into an asset evaluation/screening tool for use in portfolio 

15 management and financial planning. In the preferred embodiments, the techniques 
of the present invention create a model for predicting the value of an asset (such as 
a stock) based on various exogenous variables. The model is generated by using 
historical data for the value of the asset and for the exogenous variables. Similar 
models are then created for a pool of other assets. Such models can then be used 

20 to perform "what if analysis, allowing a user to input various scenarios and then 
obtain information as to how various characteristics of a specified asset will change. 
In addition, the techniques of the present invention can permit asset screening based 
on such characteristics. 

Figure 1 illustrates a flow diagram for explaining asset evaluation and 

25 screening according to a representative embodiment of the present invention. 
Briefly, according to Figure 1 , historical data are input for an asset; a price formula 
is determined for the asset based on the input historical data; the foregoing steps are 
repeated for different assets; scenario values are then input for certain exogenous 
variables; value(s) are calculated for selected asset(s) based on the input scenario; 

30 a tendency of the asset value to change is calculated, based on the input scenario; 
any desired screening is performed; the calculation of tendency of asset value(s) to 
change and screening steps may then be repeated for alternate exogenous variable 
scenarios; based on results of the foregoing evaluation and screening, asset 
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holdings are adjusted; in addition, the models periodically are updated to incorporate 
new historical data. 

In more detail, in step 22 historical data are input for the first asset. Such 
information preferably includes measurements of the value of the asset and other 

5 data that relate to general macroeconomic conditions, and also may include other 
information that is more specific to the asset. In the preferred embodiment of the 
invention, a list of such variables is specified and the data value for each variable is 
input at each of plural specified points in time over an extended time interval. For 
example, values for all variables at a predetermined time each day (e.g., the close 

10 of business, Pacific Time) may be input for each business day in a previous time 
period T, where T may be anytime period but preferably is at least 30 days in length, 
in order to obtain a statistically meaningful sample. For example, T may be 180 
days, 1 year, 2 years, 3 years, 4 years or even longer. Currently, it is preferable to 
use a time period T of the immediately preceding 3 years. 

15 Examples of the types of general macroeconomic data that may be included 

are any or all of the following: Federal Funds Rate Daily; 1 -year Treasury Bill Rate; 
10-year Treasury Constant Maturity Rate Daily; 30-year Treasury Constant Maturity 
Rate Daily; Moody's Seasoned Baa Corporation Bond Yield Daily; Consumer Debt 
Service Payments as Percent of Disposable Personal Income; Corporate Net Cash 

20 Flow; Net Foreign Investment; Total Consumer Credit Outstanding, Not Seasonally 
Adjusted; Trade Weighted Exchange Index: Major Currencies; Total Business 
Inventories: Manufacturers, Retailers & Merchant; Inventory/Sales Ratio: Total 
Business; Manufacturers New Orders: Non-defense Excluding Aircraft & Parts; Retail 
Sales, Not Seasonally Adjusted; New Privately Owned Housing Units Started: 

25 Structures with One Unit; Total Industrial Production Index; Oil Price Domestic: West 
Texas Intermediate; Real Gross Domestic Product in Chained 1996 Dollars; Real 
Gross Private Domestic Investment; National Defense Consumption Expenditures 
& Gross Investment; Real Nonresidential Investment: Equipment & Software; Real 
Net Exports; Consumer Price Index (CPI) for All Urban Customers; CPI - Energy; PPI 

30 - All Commodities; Money Stock; Adjusted Monetary Base; St. Louis Adjusted 
Monetary Base; NAPM: Composite Index; Composite Index of Leading Indicators, 
1992=100; University of Michigan: Consumer Sentiment; University of Michigan: 
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Inflation Expectation; Compensation of Employees; Unemployment Rate; and/or 
Median Duration of Unemployment. 

Examples of the types of information specific to the asset that may be input 
in step 22, in the case that the asset is a share of stock, or is related to the value, 

5 return or some other characteristic of a share of stock, include: (i) any of a variety of 
trading information regarding the stock, such as the stock price, stock trading 
volume, volatility of the stock price, trading price of options on the stock, information 
pertaining to analyst recommendations, and/or any of the foregoing information 
normalized with respect to data for either similar stocks (e.g., stocks in the same 

10 sector) or the market as a whole; and/or (ii) any of a variety of information pertaining 
to the company that issues the stock, such as industry classification, number of 
employees, any or all of the company's financial information (e.g., book value, 
amount of debt, debt/equity ratio, amount of profits amount of revenues or types of 
assets), usage rates of particular raw materials, employee and/or management 

15 turnover rates, and/or information pertaining the company's amount and/or type of 
diversification. 

The foregoing macroeconomic, asset-specific (other than the measurement 
of value of the asset) and/or sector-specific variables are referred to herein as the 
exogenous variables. In addition, the exogenous variables may include not only 
20 financial and economic data (such as those listed above), but any other type of data 
as well. For example, it is possible to include exogenous variables whose data 
values pertain to population, climate, popular tastes or sentiments, political 
environment, current mass media content, and/or any other social, environmental or 
physical conditions. 

25 Moreover, in addition to inputting actual historic data values, the exogenous 

variables may include forecasts of any economic or financial data (such as forecasts 
of any of the above-mentioned data) or even forecasting errors. With respect to 
forecasts and/orforecast errors, the data value for any forecast orforecast error may 
be deemed "current" (for purposes of data input) either at the time the forecast is 

30 made, as of the date/time with respect to which the forecast is made, or at any other 
arbitrarily selected time. 

In general, it will be desirable that for each time point at which data are 
entered, current data values for all exogenous variables used should be input. To 

5 
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the extent that current information for any such exogenous variable is not available 
at any such given point in time, it generally will be preferable to use the most recent 
data for such exogenous variable. For example, certain economic data may be 
announced only monthly or only quarterly. If this is the case, the most recently 

5 announced data value is used until the next announcement. Particularly in cases 
where the announcement of any data value is significantly less frequent than the 
desired frequency of data input (but also in any other cases as well), it may be 
preferable to include an additional exogenous variable that specifies how current the 
data value for one of the other exogenous variables is. 

10 In this regard, certain announced data may be indicated as being valid for only 

a specific previous historical period. For instance, a certain measurement of the 
unemployment rate for July may not be announced until late August. In such a case, 
it is preferable to use the announced unemployment rate for all data input times that 
fall in July. To the extent such unemployment rate information is required for August 

15 but is not yet announced, it is preferable to use the announced July rate for all data 
input times that fall in August (with or without a seasonal adjustment factor), together 
with an additional exogenous variable that indicates the duration of time since the 
effective date of the last announcement (e.g., a variable indicating the date in 
August). 

20 When data are subsequently input for other assets (as described below) much 

of the data previously input for the exogenous variables may be reused. However, 
to the extent that different data input times are used for different assets, it might be 
necessary to input new data values. To the extent that the data input times are 
identical (or are at least close enough in time that new announcements have not yet 

25 been made), the following is a list of the types of data that typically may be reused 
for different assets: the general economic data; the data that are not related to 
economic or financial factors; any industry-specific data, provided that two securities 
are issued by companies in the same industry sector (however defined); and any 
other data that is not unique to one asset relative to the other. 

30 In the preferred embodiment of the invention, the exogenous variables include 

only financial, economic and/or other types of data that are not particularly 
associated with any individual asset, any single class of assets or any single industry 
but instead affect various assets in various classes and industries. More preferably, 

6 
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the exogenous variables include the specific general economic information listed 
above. As a result, provided that the data input time points are the same for different 
assets, once a set of data has been input for the first time, only an indicator of asset 
value will have to be input for each additional asset. 

5 In step 24, a price formula is determined for the current asset based on the 

data input for the asset in step 22. The determined price formula relates the 
indicator of the value of the asset that was input in step 22 to the exogenous 
variables input in step 22. If Y represents the value of the asset and X represents 
a vector containing the data values for the exogenous variables that are to be used 

10 in estimating Y, then the relationship between Y and X can be expressed as: 

Y = f(X) + U (Eq. 1) 

where U is independent of X and, by including an appropriate constant term in f(X), 
15 can be assumed to be zero-mean. As a result, the expected value of Y equals the 
expected value of f(X), which can be stated algebraically as follows: 

E{Y} = E{f(X)} (Eq. 2) 

20 Eq. 1 can be expanded, for example, using a Maclaurin series expansion. In 

the simplest case of such an expansion, in which X consists of a single variable, Y 
can be expressed as follows: 

Y = (30 + pi*X + p2*X 2 + p3*X 3 + ... + U (Eq. 3) 

25 

where U is the approximation error and is independent of X. In order to obtain a 
practical representation of Y, the infinite series represented by Eq. 3 is truncated. 
Although Eq. 3 can be truncated at any point, it is presently preferable to truncate 
Eq. 3 by eliminating all powers of X greater than 2 or 3. As a result, U generally can 
30 be assumed to be uncorrelated with X. Eq. 3 above also can easily be modified to 
express the more general case of a Taylor series expansion. 



7 

0191802.1 



35512-00034 



When X consists of multiple variables X;, the Maclaurin series expansion will 
include the higher order terms of the various X; as well as cross-product terms, such 
as X, Xj. For example, the second order Maclaurin series expansion is given as: 

N N N 

5 Y=a+ y £b,X,+ mic g X,Xj (Eq - 4) 

i=\ i=l 7 = 1 



where N is the number of exogenous variables X ; . In Eq. 4, each b f is the first order 
partial derivative of Y with respect to X, evaluated at the origin, and the eg are the 

mixed partial derivatives of Y with respect to X, and X, evaluated at the origin. Of 
10 course, higher order Maclaurin series expansions are also possible (e.g., third order). 

In addition, Eq. 4 also can easily be modified to express the more general case of 

a Taylor series expansion. 

In order to determine the price formula for predicting the value of an asset, it 

is necessary to determine the coefficient values (e.g., in the case of a second order 
15 expansion, values for a and for all b, and c ;j ) in the above-described Maclaurin or 

Taylor series expansion. Such values can be determined in any of a variety of ways. 

In one embodiment, the coefficients are calculated using a statistical regression 

technique, such as by minimizing the total of some function of the error (e.g., 

magnitude of error or squared error) between each actual data point and the point 
20 predicted by the resulting formula. Such techniques are well known in the art and 

therefore are not discussed in detail here. 

Although the preferred embodiment of the invention uses a Taylor series 

expansion representation for Y, any other predefined parametric equation may used 

instead, such as a Fourier series expansion or similar frequency-space 
25 transformation. In any event, the parameters for any such predefined parametric 

equation generally can be determined in a similar manner to that described above, 

e.g., minimizing the total of some function of error. 

As a still further alternative, it is possible to determine a price formula where 

the format of the equation is not predefined, but rather determined dynamically 
30 based on the input data. The preferred method for implementing such a solution is 

to use a neural network technique. As is well known in the art, neural networks 
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typically operate by propagating data throughout a network of nodes, with a weight 
applied to each data element as it propagates from one given node to another given 
node. 

The neural network is trained to produce the correct response by inputting 
5 sample input-output pairs, observing the system's actual output in response to the 
sample input, and comparing such actual outputs to the provided sample outputs. 
The training algorithm then adjusts the weights between the nodes, and may even 
create and/or delete nodes, based on the results of the comparison. Any of 
numerous different training algorithms may be used, such as a genetic algorithm. 
10 Also, by restricting creation and deletion of nodes, a neural network may also be 
used to identify coefficients in the above-described simpler case of a predefined 
parametric equation. 

In the present case, such a neural network is trained using the data input in 
step 22 to provide the appropriate price (or other measurement of value) for the 
15 asset as a function of the data values for the exogenous variables. Once training 
has been completed using such data, the structure and weighting coefficients of the 
neural network are fixed and define a formula that provides an estimate for the value 
of the asset in response to an input of any data values for the set of exogenous 
variables. 

20 In the examples given above, the price formula expresses the actual value of 

the price (or other measure of value) of the subject asset as a function of the actual 
data values for the exogenous variables. Such a formulation lends itself will to 
determining the sensitivity of Y to each variable making up X (i.e., X,) because in this 
case the partial derivative of Y with respect to X, gives the sensitivity of the value of 

25 the asset to the exogenous variable corresponding to X,. However, in the preferred 
embodiment of the invention, the price formula is expresses a logarithm of the value 
of an asset as a function of logarithms of the exogenous variables. In this alternative 
formulation, the partial derivative of Y with respect to X; gives the elasticity of the 
value of the asset to the exogenous variable corresponding to Xj. 

30 Also, in calculating the price formula as described above, it is possible to treat 

all data points equally. Alternatively, it may instead be preferable to weight more 
recent observations more heavily than those observations that are more remote in 
time. In addition, as both the measure of the subject asset's value and the data 

9 

0191802.1 



35512-00034 



values for the exogenous that are used typically will only be estimates of the actual 
values and data values, respectively, in certain cases it may be preferable to more 
heavily weight those observations that are known with more certainty (e.g., lower 
variance). 

5 In step 26, a determination is made as to whether price formulas have been 

calculated (in step 24) for all of the assets of interest. It is noted that it may be 
desirable to perform step 24 for all assets for which data have been input (i.e., all 
assets in the tool's database) or for only the subset of such assets that are of interest 
to the current user. If the determination in step 26 is affirmative, processing 
10 proceeds to step 30. If not, processing proceeds to step 28 to input historical data 
(preferably reusing previously input data to the extent possible, as discussed above) 
for the next asset and to calculate a price formula for that asset in step 24. 

In step 30, values for the exogenous variables, collectively comprising a 
particular scenario, are input. Typically, a user will manually input such a scenario. 
15 However, some or all of the data values comprising the scenario may be generated 
automatically, such as may be provided by a separate forecasting system, e.g., using 
any of the techniques described in commonly assigned U.S. patent application serial 
numbers 09/392,361 , 09/391 ,765, 09/392,109, 09/391 ,962, 09/391 ,534, 09/392,106, 
or 09/391 ,764, filed September 8, 1999, or 09/494,200, filed January 28, 2000, all 
20 of which are incorporated herein by reference as though set forth herein in full. 

There are many different techniques for inputting a scenario. For example, 
in one embodiment of the invention, a data value for each variable is separately 
input. In an alternative embodiment, default values have already been entered for 
the exogenous variables, and therefore it is only necessary to replace those default 
25 values as desired. Preferably, the default data value for each exogenous variable 
(i.e., the data value to be used if no other data value is provided for such variable) 
is the most currently available data value for the variable. In a still further 
embodiment, only changes in the default values are required to be input, with the 
default change value being zero. It is noted that such changes may be input as 
30 either the actual expected difference from the default value or as the expected 
percentage change in the default value. Still further, it is possible to give the user the 
option as to which input method to use. Regardless of how the scenario is initially 
input, the tool according to the present invention preferably converts such inputs into 

10 
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a complete set of data values for the exogenous variables for use in the processing 
described below. 

Due to the interrelationships among the exogenous variables, when a change 
in a data value for one of the exogenous variables is input, it may be preferable in 

5 certain embodiments of the invention to automatically account for the changes 
expected in the other exogenous variables as a result of such input change. More 
details regarding such a feature and the tradeoffs pertaining to incorporating such 
a feature are described in connection with the discussion of step 34 below. 

In step 32, the value of each asset under consideration is determined, 

10 preferably by plugging the data values for the exogenous variables input in step 30 
into the price formula for such asset calculated in step 24. The price formula 
typically can be calculated in a straightforward manner by replacing the exogenous 
variables with the corresponding scenario data values and then calculating the result 
of the formula. In an embodiment in which a neural network (or similar network- 

15 based solution) is used, the data values forthe exogenous variables typically can be 
provided as the inputs to the network, with the network output being the asset value 
estimate. 

In step 34, the tendency(ies) of one or more of the asset value(s) to change 
as a result of change(s) in one or more of the exogenous variables are calculated, 
20 based on the input scenario. Preferably, such tendencies will be sensitivities and/or 
elasticities of the value of the asset to one or more of the exogenous variables. 
However, any other measures of tendency to change may instead (or also) be 
calculated. 

Ordinarily, in the case where a predefined parametric equation is used, these 
25 calculations will mainly involve taking or estimating partial derivatives of the price 
function with respect to the exogenous variables of interest. In particular, when the 
price formula is determined from a pre-defined parametric equation, a closed-form 
solution for each such partial derivative often can be determined in advance and then 
stored. For instance, assume that Y is given as a second order polynomial function 
30 of X, that X consists of only two variables, X, and X 2 , and that Y represents the 
actual value of the asset and the X, represent the actual data values for the 
exogenous variables. In this case, the sensitivity of Y to X, is found by taking the 
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partial derivative of Y (as expressed in Eq. 4 above) with respect to X, and therefore 
is given as: 

4y^ = h + 6 2 0& 2 /<*i)+ c l2 x 2 + c, 2 JC, 0*2/^1) + 2c„x, + 2c 22 x 2 (^ 2 M) (Eq. 5) 

5 

In Eq. 5, the b and c coefficients were determined in step 24 and the values 
for X, and X 2 were input in step 30. The only remaining value to be supplied is the 
partial derivative of X 2 with respect to X,. This value can be assumed to be zero if 
X, and X 2 are known to be largely independent of each other or, subject to the 

10 considerations described below, may be arbitrarily assumed to be zero. Otherwise, 
the relationship between X, and X 2 can be determined by performing a linear or non- 
linear regression technique using historical data for the two corresponding 
exogenous variables, by performing a neural network technique using such data to 
train the network, or in any other manner. Regardless of which technique is used, 

15 it is preferable also to evaluate the statistical significance of the correlation between 
X { and X 2 and then to assume that dX 2 /dX, is zero if such statistical significance is 
less than a specified (e.g., predetermined) threshold. 

In general, in order to obtain a closed-form solution of each partial derivative 
of any price formula that is expressed as a polynomial expansion, it typically will be 

20 necessary to either: (i) evaluate the partial derivative of each exogenous variable with 
respect to each other exogenous variable and also to evaluate the statistical 
significance of each such partial derivative; or (ii) assume that such partial 
derivatives are equal to zero. 

In certain cases, a closed-form solution of the partial derivatives cannot easily 

25 be obtained. For example, this situation is likely to occur when a neural network is 
utilized or even in certain cases involving more complicated pre-defined parametric 
equations. In such cases, it is possible to obtain an estimate of the instantaneous 
derivative directly from the price formula obtained in step 24. In one example, such 
an estimate is obtained by observing the value of the asset calculated in step 24 

30 using the scenario input in step 22, slightly changing the data value of one of the 
exogenous variables (e.g., by 1% of its previous value), and then calculating the 
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change in the asset's value divided by the change in the data value for the 
exogenous variable. 

On the other hand, recognizing that a change in one exogenous variable 
might correlate with changes in one or more other exogenous variables, an 

5 alternative embodiment of the present invention estimates the instantaneous 
derivative of the price formula in such cases by taking into account any changes that 
are likely to occur in the other exogenous variables as a result of the small change 
in the one exogenous variable. To accomplish this, in one embodiment of the 
present invention the partial derivative (e.g., sensitivity) of each exogenous variable 

10 with respect to each other exogenous variable and the statistical significance of each 
such partial derivative are calculated, such as described above. Then, the effect of 
a slight change in the data value of one of the exogenous on the data values of the 
other exogenous variables can be readily calculated. Accordingly, the calculated 
changes in such other exogenous variables are applied, as well as the change in the 

15 data value for the subject exogenous variable, and the resulting new set of data 
values is input into the network (or plugged into the price formula) to calculate a new 
value for the asset. By dividing the change in the asset value by the change in the 
data value for the subject exogenous variable, it may be possible to obtain a more 
complete measure of the tendency of the asset value to change as a result of a 

20 change in a particular one of the exogenous variables. 

As indicated above, two distinct approaches exist for determining the 
tendency of an asset value to change as the result of a change in the data value for 
an exogenous variable. In the first approach, the sensitivity of each exogenous 
variable to each other exogenous variable is ignored (i.e., the exogenous variables 

25 are treated as being independent). In the second approach, the sensitivities of the 
exogenous variables to each other are taken fully into account in determining the 
tendency of the asset value to change as a result of a change in one of the 
exogenous variables. The particular approach selected typically will depend upon 
the needs of the user. 

30 With the first approach, the user generally will be required to account for 

correlations between the exogenous variables in some other way, such as in 
connection with subsequent processing of the various price sensitivities, elasticities 
or other measurements of tendency to change. On the other hand, with the second 
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approach, in subsequent processing the user typically must recognize that certain 
changes in the exogenous variables have been automatically anticipated; therefore, 
such subsequent processing: (i) generally must attempt, for each exogenous 
variable, to estimate only that portion of the change that would not have been 

5 predicted by previously entered changes in the other exogenous variables; and (ii) 
may be required, in certain circumstances, to back out redundantly reflected 
relationships among the exogenous variables. 

It should be noted that similar considerations and tradeoffs in determining 
whether to reflect expected changes in related exogenous variables may also be 

10 made in connection with the inputting the various projected scenarios in step 30 
(discussed above). Also, it is possible to make either option available to the user 
(which options may be made available independently for steps 30 and 34) and let the 
user select the appropriate option to use for each application (e.g., by selecting a 
corresponding configuration setting). 

15 As noted above, in the event that the price formula obtained in step 24 relates 

actual value of the asset to actual data values for the exogenous variables (i.e., Y 
represents the value of the asset and X { represents the data value of the 
corresponding exogenous variable), then simply estimating a partial derivative of Y 
with respect to X; will provide the sensitivity of the asset value to such exogenous 

20 variable. Obtaining the elasticity of the asset value to such exogenous variable in 
this case will require calculating (X i A^)*(dY/dX i ). On the other hand, if the price 
formula obtained in step 24 relates the logarithm of the asset value to the logarithms 
of the data values for the exogenous variables (i.e., Y represents a logarithm of the 
value of the asset and Xj represents a logarithm of the data value of the 

25 corresponding exogenous variable), then simply estimating a partial derivative of Y 
with respect to X^ will provide the elasticity of the asset value to such exogenous 
variable. 

In step 36, any desired asset screening is performed. According to the 
present invention, such screening can be based on the scenario-based estimates of 
30 asset values and/or tendencies of the asset values to change in response to 
changes in the exogenous variables (e.g., sensitivities or elasticities), calculated 
above, instead of or in addition to the factors conventionally used for screening 
stocks and other assets. 
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For example, a user may in step 30 input a scenario in which the 
unemployment rate, inflation rate and the price of crude oil change in specific 
amounts, but all other exogenous variables remain at their default values (or, 
depending upon the system configuration and possibly the option settings selected 

5 by the user, change in the amounts expected based on the changes specified for 
those three exogenous variables). After steps 32 and 34 have completed for all 
assets desired to be searched, the user may then in step 36 search for all stocks that 
have increased in value by at least a specified percentage and that have price 
elasticities to the Japanese exchange rate and to Gross National Product that are 

10 within a specified range. The user may further limit the search to only those stocks 
issued by companies that have fewer than 500 employees. In fact, assets can be 
screened in this manner based on any combination of projected asset value under 
the specified scenario, sensitivity or elasticity to any exogenous variable(s) given the 
specified scenario, and/or any other information that has been input or derived for 

15 assets in the database (e.g., any of the information conventionally used for asset 
screening). 

It is noted that it is not necessary to calculate a value for each asset in step 
32 and a tendency of asset value to change for each asset with respect to each 
exogenous variable in step 34. Rather, steps 32 and 34 may instead be performed 

20 only to the extent needed in connection with a user's analysis of particular assets or 
in connection with screening over an identified group of assets. For example, it may 
be more efficient in the example given above to first identify those companies that 
have fewer than 500 employees in the database and then calculate the asset values 
in step 32 only for those companies and calculate tendencies of the asset values to 

25 change in step 34 only for those companies and only with respect to the Japanese 
exchange rate and to the Gross National Product. 

In step 37, the user's holdings are adjusted based on the results of the 
analysis in steps 32 and/or 34 and/or based on the screening in step 36. For 
example, after determining a projected value and projected elasticities for an 

30 individual stock based on an input scenario, a user may decide to sell some or all of 
the stock, short sell the stock, purchase the stock, purchase or sell an option on the 
stock, purchase or sell another derivative instrument whose value is based on a 
characteristic of the stock, and/or initiate any other purchase, sale or other economic 
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transaction to meet the user's financial goals. Such decisions may be: (i) made 
solely by the user based on the above-described information provided by the 
evaluation/screening tool of the present invention; (ii) recommended to the user by 
the evaluation/screening tool by including within the tool capabilities for a user to 

5 input the user's financial goals and process steps for automatically screening stocks 
and/or other financial assets to attain those financial goals (which recommendations 
may be provided by the evaluation/screening tool to the user with or without the 
underlying data on which such recommendations were made); and/or (iii) performed 
automatically by the evaluation/screening tool without user input after evaluating the 

10 user's financial goals and performing any indicated screening (e.g., according to 
predetermined process steps). 

Options (ii) and (iii) above require the evaluation/screening tool of the present 
invention to include additional analytical functionality, typically directed toward 
making the tool more user-friendly. However, such functionality generally is relatively 

15 straightforward to implement. For instance, assume that a user has indicated that 
he wants to maximize growth within a specified time horizon, subject to the condition 
that risk should be limited with respect to certain specified exogenous variables. In 
this case, the tool preferably would search the stocks in the database and sort such 
stocks into groups having negative, positive and approximately zero elasticities to 

20 each of the exogenous variables; calculate the expected returns to each such stock; 
and then construct a portfolio, possibly using an iterative technique, that balances 
the elasticities to within the specified limits while achieving the maximum possible 
return. Depending upon whether option (ii) or option (iii) is being implemented, the 
resulting trades required to achieve that portfolio are then either recommended to the 

25 user or initiated automatically by the tool. The process steps which determine which 
trades to make may also be supplemented to account for tax implications and/or 
trading costs. 

Although step 37 is shown in Figure 1 and discussed above as being 
performed after step 36, it should be understood that step 37 may also or instead be 
30 performed at various points in the process, such as immediately after step 32 or 
immediately after step 34. 

In step 38, a determination is made as to whether any additional scenarios 
need to be tested. In the preferred embodiment of the invention, the user simply 
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indicates whether or not he or she would like to input another scenario. However, 
if the tool according to the present invention is incorporated into a more extensive 
financial or economic analysis system, another program or system might provide this 
indication. If analysis of an additional scenario is desired, then processing returns 

5 to step 30 to input data values for the exogenous variables. If not, then processing 
proceeds to step 40. 

In step 40, a determination is made as to whether the model needs to be 
updated. In the preferred embodiment of the invention, the price formulas are 
recalculated periodically using historical data over a rolling period of time. For 

10 example, the price formula may be generated using data over the past three years 
and recalculated each business day. When it is time to update the model, 
processing returns to step 22 to input historical data for the first asset, together with 
historical data values for the contemporaneous exogenous variables. When step 22 
is being repeated, as contrasted with the first time the entire process is executed, it 

15 generally is not necessary to input the entire data set. Rather, only the new data 
generally need to be added and the old data (outside the rolling period) deleted from 
the data set upon which the price formula is calculated. In addition, weights may be 
reassigned to reflect the relative recency of each data entry in the thus newly formed 
data set. 

20 Figure 2 illustrates a flow diagram for evaluating/screening portfolios 

according to a representative embodiment of the present invention. In general, much 
of the processing for evaluating/screening portfolios will be similar to that used for 
analyzing/screening individual assets, but with certain additional functionality. In fact, 
in the embodiment described below, both types of functionality are provided. 

25 In step 122, historical data for a first asset are input. This step is essentially 

identical to step 22 described above. 

In step 124, a price formula is determined for the current asset. This step is 
essentially identical to step 24 described above. 

In step 126, a determination is made as to whether a price formula has been 

30 calculated for the last asset to be processed. As with step 26, discussed above, step 
124, may be performed for all assets for which data have been input or only a subset 
of such assets that are of interest to the current user. If the determination is 
affirmative, processing proceeds to step 130. If not, processing proceeds to step 

17 

0191802.1 



35512-00034 



128 to input historical data (preferably reusing previously input data to the extent 
possible, as discussed above) for the next asset and to calculate a price formula for 
that asset in step 124. 

Steps 128 and 130 are essentially identical to steps 28 and 30, respectively, 

5 as such steps are described above. 

In step 131, composition information is input for one or more portfolios of 
interest. Preferably, the input portfolio composition information includes the type and 
quantity of each asset (e.g., type of stock and number of shares). Such information 
may be input directly by a human user via a user interface (e.g., a graphical user 

10 interface) or may be input by another computer program or system operating in 
conjunction with the asset evaluation/screening tool of the present invention. 

In step 132, asset values are calculated. With respect to individual assets, 
such as individual stocks or individual commodities, this step is essentially identical 
to step 32 described above. However, in addition to allowing the user to obtain the 

15 value of individual assets, in this embodiment of the invention step 1 32 also allows 
the user to obtain the value of the portfolios defined in step 131 under the scenario 
input in step 130. Such a portfolio value preferably is obtained by summing the 
values of the assets included within the subject portfolio. 

I n step 1 34, the tendencies of asset values to change in response to changes 

20 in the exogenous variables are calculated. With respect to individual assets, such 
as individual stocks or individual commodities, this step is essentially identical to step 
34 described above. However, in addition to allowing the user to obtain measures 
of the tendencies of the values of individual assets to change, in this embodiment of 
the invention step 134 also allows the user to obtain similar measures for the 

25 portfolios defined in step 131 under the scenario input in step 130. Such a measure 
for the portfolio preferably is obtained by performing a weighted average of the 
corresponding measures for the assets included within the subject portfolio. 

In step 136, any desired screening is performed. With respect to individual 
assets, such as individual stocks or individual commodities, this step is essentially 

30 identical to step 36 described above. However, this step preferably allows the user 
to search from among different portfolios as well as different individual assets, using 
any of the screening criteria described above for individual assets. 
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In step 137, adjustments in the user's holdings are made based on the 
evaluation/screening data provided by steps 132, 134 and/or 136. This step is 
essentially identical to step 37 described above, but extended to include more 
portfolio-related transactions. Thus, a user may be allowed to supplement or modify 
5 the user's existing portfolio(s) or create one or more additional portfolios. As noted 
above, such actions can be fully automated or can be implemented with varying 
levels of participation from the user. 

In step 138, a determination is made as to whether an additional scenario is 
required to be analyzed. This step is essentially identical to step 38 described 

10 above. However, here the user has the option to alter not only the data values for 
the exogenous variables but also the composition of one or more portfolios. As a 
result, the user is provided with significant flexibility to project how various changes 
in his or her portfolio, as well as changes in external conditions, will affect the 
portfolio's value and/or the portfolio's exposures to various specific risks. 

15 Finally, in step 140 a determination is made as to whether the model needs 

to be updated. This step and the considerations pertaining thereto are essentially 
identical to step 40 described above. If the model does need to be updated, 
processing returns to step 122. 

In the foregoing embodiments of the invention, a price formula is calculated 

20 based on historical data for values for of an asset and historical data values for a 
number of exogenous variables, and then a measure of the tendency of the asset 
value to change as a result of changes in the exogenous variables is calculated from 
that price formula. It should be noted that it is also possible to directly calculate a 
return formula that expresses changes in the value of the asset as a function of 

25 changes in the data values for the exogenous variabfes. For instance, by initially 
inputting data values corresponding to changes in the value of the asset (e.g., either 
actual quantity changes or percentage changes) over some period of time (preferably 
a rolling period of time) and changes in the exogenous variables (e.g., quantity 
changes or percentage changes) over the same period of time, a return formula that 

30 relates such price changes to such changes in the exogenous variables can be 
obtained , using either a linear or non-linear regression or a neural network technique, 
in a similar manner to that described above. 
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It is also possible to calculate the price or return formulas, and corresponding 
measures of tendency of asset value to change based on the exogenous variables, 
separately in different environments. For instance, it is possible to calculate one 
price formula using only data for periods of increasing inflation rates and another 
5 price formula calculated using only data for periods of decreasing inflation rates. In 
this way, for instance, it can be determined whether the elasticities of the asset 
values to inflation rates are symmetric (i.e., the same during periods of rising inflation 
as during periods of declining inflation) or, if not, how they differ. Alternatively, 
similar information could be obtained by including an additional variable that 

10 indicates the change in the rate of inflation. Such an additional variable could be 
binary (i.e., indicating either increasing ordecreasing inflation rates) orcould indicate 
the change in the inflation rate (either in terms of the quantity change in the inflation 
rate or in terms of the percentage change in the rate). 

In addition to determining different environments in the foregoing deterministic 

15 manner, it also may be preferable, in certain circumstances, to dynamically define 
the different environments for which separate models are to be generated. For 
instance, after collecting historical data over the three previous years, such data may 
be subject to statistical cluster analysis (as described in more detail below). The 
resulting clusters may then be interpreted as distinct economic environments, for 

20 which different price or return models may be generated. The subsequent scenario- 
based processing will then use the model corresponding to the environment in which 
the input scenario falls. Utilizing separate scenarios in this fashion often may provide 
more accurate prediction and estimation results, because each model can be 
separately tailored to a unique environment and also because, at least in the case 

25 of a Taylor or Maclaurin series expansion, the dispersion of the historical data points 
around the expansion point often can be significantly reduced. It is noted that, 
generally, each such expansion point will be located at or near the center of the 
corresponding environment. 

In this latter regard, it is noted that the location of the expansion point for a 

30 Taylor or Maclaurin series expansion generally will affect the accuracy of the 
resulting model. In addition to locating the expansion points as indicated above, the 
expansion point may be located at, or otherwise based on, the input scenario. 
Similarly, the expansion point may be located at, or otherwise based on, an 
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independently generated prediction of the future environment, such as using a 
prediction made in accordance with any of the techniques described in co-pending 
applications 09/392,361, 09/391,765, 09/392,109, 09/391,962, 09/391,534, 
09/392,106, or09/391 ,764, filed September8, 1999, or09/494,200, filed January 28, 
5 2000. 

By using a large number of exogenous variables (e.g., at least 30 to 50) the 
price formulas (or return formulas) of the present invention often can approximate 
the reduced form of the actual value of (or return to) the asset, whatever that form 
may happen to be. As a result, it is preferable to use such a large number of 

10 variables in the technique of the present invention. Currently, it is most preferable 
to use approximately 35 exogenous variables. 

The above discussion frequently refers to the "value" of an asset. Generally, 
the value of an asset will be the price at which it is traded. However, other 
measurements of value may be used in addition to or in place of selling price. Such 

15 other measurements may be of particular importance, for example, when the subject 
asset is thinly traded, the subject asset frequently is traded in combination with other 
assets, or there exists any other factor that makes selling price an inappropriate 
indicator of the asset's value. As used herein, an "asset" may refer to a stock, a 
commodity, an index, a mutual fund, a derivative instrument whose value is based 

20 on the value, or on some other characteristic, of any of the foregoing, or any other 
item of value. 

The issue of statistical significance of the estimated measures of tendencies 
of the asset values to change based on changes in the exogenous variables is 
important. There may be numerous instances where there is no statistical 

25 significance to the estimated price formula and, consequently, no statistical 
significance to the estimated measure of tendency to change. 

One approach to this problem is to use a statistical significance threshold 
(e.g., as part of any screening). The statistical significance of each coefficient can 
be tested using Student's t-test. Similarly, the statistical significance of groups of 

30 coefficients can be tested using the f-test. With respect to the latter, a number of 
groups may be defined, each group corresponding to a single exogenous variable 
and including the coefficients associated with all terms that include that exogenous 
variable. It is noted that in this example, if a second or higher order Taylor series 

21 

0191802.1 



35512-00034 



expansion had been used, the existence of cross-product terms will mean that the 
defined groups will overlap. 

Alternatively, the group may include all coefficients used in the price formula. 
Then, any asset for which the identified coefficients have insufficient statistical 

5 significance (e.g., a p value exceeding some threshold, such as 5% significance) 
preferably would be excluded from the candidate pool for screening and generally 
would not be used for most other purposes in which the asset would be considered 
individually. However, in certain cases where aggregate statistics are to be 
calculated across multiple assets, the data for such an asset may be useful. 

10 In the foregoing estimation of statistical significance, the p value associated 

with any given t-test or f-test can be estimated with reference to a specified 
confidence interval for each of the subject coefficients. Alternatively, such 
confidence interval(s) can be specified and then the p value associated with such 
confidence interval(s) can be determined. For instance, it is possible to specify a 

15 confidence interval of ± 5% for each coefficient (i.e., for each coefficient, the interval 
from 95% to 105% of the estimated value for such coefficient) and then determine 
the p value associated with such interval(s) (i.e., the probability that any of such 
coefficients is outside of the ± 5% confidence interval for its estimated value). 
Typically, such a probability will not be constant, but rather will depend upon the 

20 particular scenario (i.e., the input data values for the set of exogenous variables). 
For instance, the p value generally be significantly higher within a region of the 
exogenous variable space in which relatively little of the historical data used in 
creating the underlying model is located than in other regions where more such 
historical data points were located. Similarly, for a specified p value, the width of the 

25 confidence intervals typically will depend upon the particular scenario, with wider 
intervals tending to occur in regions of the exogenous variable space in which there 
was relatively little historical data input in step 122. 

It is noted that the f-test can be applied to the price formula to determine the 
statistical significance of the value estimate or to the partial derivatives of the price 

30 formula to determine the statistical significance of the sensitivity, elasticity or similar 
measure. In the event that the f-test is applied to all coefficients in a formula, one 
can obtain a p value that corresponds to a specified confidence interval for the 
endogenous variable or, alternatively, a confidence interval for a specified p value. 
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Thus, it is possible to calculate a p value for value of an asset within a specified 
confidence interval, a p value for a measure of the tendency of an asset value to 
change within a specified confidence interval, or corresponding confidence intervals 
given specified p values. 

5 In addition to evaluation and screening based on a single scenario, the 

present invention also contemplates evaluating and screening based on multiple 
different scenarios. For instance, the user may input a range of data values for one 
or more of the exogenous variables. In this case, the evaluation/screening tool of the 
present invention preferably samples the data values within each such range and 

10 combines the sampled data values to provide multiple different scenarios. After 
calculating asset values and tendencies of asset values to change for each such 
scenario, the evaluation/screening tool may output a range of asset values and a 
range of elasticities (or similar measures) for each asset. Such information may then 
be used as the basis for screening criteria. 

15 As will be observed from the above discussion, the asset evaluation/screening 

tool of the present invention can provide a user with a variety of information that can 
be directly used to maximize the value of the user's portfolio, while limiting the user's 
exposure to particular risk. For example, the user can alter the mix of his or her 
portfolio, input a projected scenario, view how the portfolio value and exposure to 

20 specific risks changes based on that projected scenario, search for optimal assets 
or combinations of assets under specified criteria, and then repeat this process for 
different portfolio compositions, different scenarios and/or different financial and 
economic criteria or goals. 

25 Significance-Based Display . 

Once data have been generated by the evaluation/screening tool of the 
present invention, it often will be desirable to display some or all of such data to the 
end user. For example, it may be desirable to display data concerning the elasticities 
of various assets to the rate of inflation during a given scenario. Conventionally, 

30 information may be displayed in several different ways. For instance, it is possible 
to display such information in a tabular format or in a graphical format. In the 
evaluation/screening tool of the present invention, due to the large amount of 
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information that must be presented simultaneously, it is preferable to display at least 
some of the information graphically. 

For instance, the elasticity data in the above example preferably is displayed 
in a bar graph format, with each different asset corresponding to a point on the x axis 
5 and the elasticity for each asset being represented by a bar whose height 
corresponds to the magnitude of the elasticity, and with the bar originating at y=0 and 
extending upward for positive elasticity and downward for negative elasticity. In 
addition to indicating the magnitude and direction of the elasticity for each asset, the 
display according to the preferred embodiment of the present invention also indicates 

10 the statistical significance of the elasticity for each asset. More preferably, the 
intensity at which the bar for each asset is displayed preferably is a function of the 
statistical significance of the calculated elasticity for that asset. Such a display is 
illustrated in Figure 3. 

Specifically, Figure 3 illustrates a bar graph according to the present invention. 

15 In Figure 3, each bar corresponds to a different asset (e.g., stock) and the height (or 
length) of the bar is proportional to the asset's elasticity to a specified exogenous 
variable (e,g., the Federal Funds Rate). It should be noted that the height of each 
bar may instead be any other function of the elasticity for the corresponding asset, 
although preferably that function is the same for all assets that are displayed at the 

20 same time. Also, although elasticities for various assets are displayed in Figure 3, 
any other measure of a tendency of an asset' s value to change based on a change 
in an exogenous variable may instead be displayed. For simplicity, the following 
discussion will continue to refer to elasticities, it being understood any other such 
measure of tendency to change may be substituted therefor. 

25 As discussed above, the statistical significance for each elasticity calculation 

can be determined, such as by applying the f-test to the coefficients of the elasticity 
formula, which in turn may be derived from the price formula. The resulting p value 
provides a measure of the statistical significance of the calculated elasticity. As also 
noted above, the p value may be tied to a confidence interval for the determined 

30 elasticity or to a set of confidence intervals for the coefficients used in the elasticity 
formula. Similarly, the p value may be dependent upon the point in the exogenous 
variable space (i.e., the particular input scenario). In the preferred embodiment of 
the invention, the p values are calculated with respect to similar confidence intervals 
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across all assets to be displayed and at the same input scenario, such that 
comparisons between the p values will be meaningful. 

The bars 1 80 to 1 83 in Figure 3 reflect the different statistical significances of 
the different assets by being displayed in different intensities, the intensity of each 
5 such bar being a function of the statistical significance of the corresponding asset. 
For instance, the intensity at which a bar is displayed might be equal to 1 minus the 
p value for the corresponding asset, where intensity ranges from 0 (meaning that the 
bar is not displayed at all) to 1 (meaning maximum intensity). In Figure 3, intensity 
is illustrated by the density of the horizontal lines within a bar. Thus, bar 180 is 

10 displayed at a high intensity, indicating that the calculated elasticity of the 
corresponding asset is highly statistically significant (e.g., a p value of 0.05). By 
contrast, bar 1 81 is displayed very lightly, indicating a very low statistical significance 
(e.g., a p value of 0.90). Between these extremes are bars 182 and 183, which 
indicate intermediate levels of statistical significance. 

15 Using such a linear relationship between display intensity and statistical 

significance may be desirable in certain embodiments. However, in other 
embodiments it may be more desirable to highlight certain differences more than 
others. For instance, if one is only interested in very significant data, it may be more 
desirable to non-linearly map the p value (or other measure of statistical significance) 

20 to intensity such that more intensity levels are used in the high end of statistical 
significance (e.g., around p values near 0) than at the low end of statistical 
significance (e.g., around p values near 1). 

Also, although a bar graph is utilized in the foregoing example, it should be 
understood that the technique of varying display intensity levels based on the 

25 statistical significance of the data being displayed can be beneficially used in other 
graphical display methods as well. Such other graphical display methods include 
simply plotting individual data points on a graph, graphical techniques in which a line 
or curve is interpolated between each adjacent pair of data points so as to indicate 
a continuously changing endogenous variable, and any other graphical display 

30 method . Similarly, such techniques may be applied in any other situation where data 
to be displayed have been estimated and have an associated statistical significance. 
The actual measure of statistical significance preferably depends upon the type of 
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data to be displayed, and may include, for example, standard deviation, variance, 
correlation coefficient, and/or any function of the foregoing, in addition to p value. 

The variation in display intensity required for the present invention can be 
accomplished using known techniques. When using a display device having variable 
5 intensity, such as a cathode ray tube (CRT) display, the required intensity is simply 
specified for each display point. When using a monochrome display device, such 
as certain liquid crystal displays or many printing techniques, the appearance of 
varying intensity can be provided by using halftoning, error diffusion or other known 
techniques. 

10 As indicated above, in the preferred embodiments of the invention, statistical 

significance is displayed by changing the intensity of the displayed data points as a 
function of their statistical significance. However, it is also possible to graphically 
indicate statistical significance in other ways as well, including other ways in which 
statistical significance is indicated without requiring a separate coordinate for it on 

15 the graph. For instance one could vary the size of a displayed data point, the width 
of each bar in a bar graph, or the width of line and/or curve segments in a chart 
graph as a function of the statistical significance of the corresponding data points. 
Alternatively, one could vary the hue, saturation, brightness or any other display 
characteristic of the displayed points as a function of statistical significance. For 

20 example, colors at the red end of the color spectrum might indicate low statistical 
significance while colors at the violet end of the color spectrum would indicate high 
statistical significance, or vice versa. As used herein, "display characteristic" is 
intended to mean the way in which a data point is displayed, rather than the position 
at which it is displayed. With any of such alternate display techniques, as well as the 

25 preferred intensity-based technique, the display characteristic (e.g., size, width or 
color property) may be related to the statistical significance by any linear or non- 
linear function. 

Identifying Industry Sectors Using Clusterization . 
30 Once asset sensitivities, elasticities or other measures of tendency to change 

with respect to a number of different exogenous variables have been calculated, 
such as pursuant to the techniques described above, such measures can then be 
used to identify true industry sectors using conventional clusterization techniques. 
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For example, assume that there exists a collection of stocks, such as several 
hundred or several thousand different stocks, that are to be assigned to industry 
sectors. Assume further that elasticities have been calculated for each such stock 
with respect to each of a number of different exogenous variables (e.g., between 30 
5 and 50 such variables). In order to initially simplify the discussion, it is also assumed 
that the elasticities of each asset value to each exogenous variable is constant, such 
as may have been obtained by performing a multi-variate linear regression. In such 
a case, utilizing cluster analysis, a standard statistical grouping method, in an 
innovative manner, the present invention is able to identify relevant sectors and 

10 simultaneously assign the various stocks into those sectors. Accordingly, the 
problems with conventional sectoral analysis, sectoral definition and asset 
classification, are solved simultaneously. 

Cluster analysis algorithms (such as are available in Systat and numerous 
other multi-variate statistics computer programs) attempt to group the data into 

15 clusters such that the measured distance between individual data points within each 
cluster is a minimum, but also such that the measured distance between any two 
clusters is maximized. In other words, cluster analysis attempts to group data points 
so that the groups are as much alike as they can reasonably be, but also so the 
groups are as reasonably different from other groups as they can be. There are 

20 numerous standard methods for clustering data which could be employed, including: 
discrimination functions, factor analysis, and grouping techniques such as iterated 
Chi-Square and maximum-distance measures. 

A preferred embodiment of the invention uses the KMEANS statistical 
procedure, included in statistical packages such as SYSTAT and the S+ statistical 

25 modeling language. The KMEANS algorithm splits N assets into groups by 
maximizing the between-group distance and minimizing the within-group distance. 
It is noted that there are numerous possible distance measures which could be used, 
such as Pearson Product Moment Correlation, Sum of Squared Deviations, and 
Rsquared (1 - Squared Pearson Product Moment Correlation), or the Minkowski 

30 distance, the z-th root of the mean z-th powered coordinate distance, e.g., with an 
initial parameter z = 2. 

The cluster analysis of the present invention may be performed overthe entire 
set of exogenous variables or over any subset thereof. By defining each resulting 
cluster to be a sector, the present invention automatically provides sector definitions. 
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Moreover, because the sectors are formed by clustering assets having similar 
elasticities (or other measures of tendency to change based on changes in the 
exogenous variables), it is more likely than in conventional techniques that the stocks 
in each sector do in fact respond similarly to market conditions or, more accurately, 
5 to the set of conditions represented by the exogenous variables used. 

After having obtained sectors and sector assignments in this manner, 
aggregate statistics for each sector can be calculated and monitored over time in 
order to assess changes in various industries and to utilize such changes to predict 
changes in other industries, as well as in various macroeconomic data. Such 

10 aggregate statistics might include, for example, total gross revenues, total profits, 
total employment, average profit margin, total market capitalization, total inventory 
as well as changes in the foregoing data. Based on the predictions derived from 
such data, assets may be purchased or sold. For instance, declining profitability and 
increasing inventory in a sector that includes a significant number of computer 

15 hardware manufacturers might signal a future decrease in demand for computer 
chips, prompting one to sell stock in computer chip manufacturers. 

Preferably, the elasticities for the stocks in the current example will have been 
determined by using data over some fixed interval of time. By recalculating such 
elasticities on a rolling basis, one can observe how assets move both relative to their 

20 clusters and among clusters over time. Any such changes might signal, for example, 
a change in the direction or management of the underlying company, a change in a 
company's methods or technology that is making that company's business more or 
less dependent on a particular input to production (e.g., a particular type of laborer 
raw materials), or even a diversification by the company into other types of business 

25 that are affected by different conditions. In addition, one may observe how the 
sector definitions themselves change overtime, indicating potential changes in an 
entire industry. 

In a somewhat more complicated example, assume that the elasticities (or 
other measures of tendency of asset value to change as of result of changes in the 
30 exogenous variables) are expressed as a function of the exogenous variables. This 
generally will be the case, for example, where the price function or return function 
has been determined using a non-linear regression or a neural network technique. 
In this case, the assets can be clustered using the foregoing technique and inputting 
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current data values for the exogenous variables (i.e., using the current scenario). 
Alternatively, the assets might be clustered over multiple scenarios, such as by 
sampling the elasticities using such multiple scenarios and then clustering on the 
basis of all resulting data. Such multiple scenarios might be selected, for example, 

5 to include the current scenario and group of scenarios in the neighborhood of the 
current scenario. Still further, separate clustering might be performed for each such 
separate scenario and then the resulting sectors compared across different 
scenarios. Also, as with the example described above, the sectors may be 
recomputed on a rolling basis and changes in both the assets and the sector 

10 definitions observed over time. 

The various techniques described above may be used in any or all possible 
combinations, depending upon the data needs of the end user. Common to all such 
embodiments, however, is the grouping of assets based on similarities of their 
tendencies to change in value as the result of changes in a set of exogenous 

15 variables. The most common application of this aspect of the invention is for use in 
defining business sectors and for classifying stocks into those sectors. However, the 
techniques described above may be used on connection with any other types of 
assets as well. By grouping assets in this manner, the present invention provides the 
basis for predicting future changes in both asset values and macroeconomic 

20 variables. Such data and predictions can be directly incorporated into existing and 
future models for selecting stocks and other assets to purchase and sell, thereby 
having direct application to asset portfolio management and financial planning. In 
fact, many existing models incorporate sectoral statistics for just such purposes. The 
results of this aspect of the present invention can be used beneficially in such 

25 models. Moreover, because sectoral analysis of the present invention overcomes 
many of the problems of conventional sectoral analysis techniques, substitution 
using the results of the present technique often will provide more accurate 
information, thus permitting those models to provide more effective buy/sell 
strategies. 

30 
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Design System Environment . 

Generally, the methods described herein will be practiced with a general- 
purpose computer, either with a single processor or multiple processors. Figure 4 
is a block diagram of a general-purpose computer system, representing one of many 
5 suitable computer platforms for implementing the methods described above. Figure 
4 shows a general-purpose computer system 250 in accordance with the present 
invention. As shown in Figure 4, computer system 250 includes a central processing 
unit (CPU) 252, read-only memory (ROM) 254, random access memory (RAM) 256, 
expansion RAM 258, input/output (I/O) circuitry 260, display assembly 262, input 
10 device 264, and expansion bus 266. Computer system 250 may also optionally 
include a mass storage unit 268 such as a disk drive unit or nonvolatile memory such 
as flash memory and a real-time clock 270. 

CPU 252 is coupled to ROM 254 by a data bus 272, control bus 274, and 
address bus 276. ROM 254 contains the basic operating system for the computer 
15 system 250. CPU 252 is also connected to RAM 256 by busses 272, 274, and 276. 
Expansion RAM 258 is optionally coupled to RAM 256 for use by CPU 252. CPU 
252 is also coupled to the I/O circuitry 260 by data bus 272, control bus 274, and 
address bus 276 to permit data transfers with peripheral devices. 

I/O circuitry 260 typically includes a number of latches, registers and direct 
20 memory access (DMA) controllers. The purpose of I/O circuitry 260 is to provide an 
interface between CPU 252 and such peripheral devices as display assembly 262, 
input device 264, and mass storage 268. 

Display assembly 262 of computer system 250 is an output device coupled 
to I/O circuitry 260 by a data bus 278. Display assembly 262 receives data from I/O 
25 circuitry 260 via bus 278 and displays that data on a suitable screen. 

The screen for display assembly 262 can be a device that uses a cathode-ray 
tube (CRT), liquid crystal display (LCD), or the like, of the types commercially 
available from a variety of manufacturers. Input device 264 can be a keyboard, a 
mouse, a stylus working in cooperation with a position-sensing display, or the like. 
30 The aforementioned input devices are available from a variety of vendors and are 
well known in the art. 

Some type of mass storage 268 is generally considered desirable. However, 
mass storage 268 can be eliminated by providing a sufficient mount of RAM 256 and 
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expansion RAM 258 to store user application programs and data. In that case, 
RAMs 256 and 258 can optionally be provided with a backup battery to prevent the 
loss of data even when computer system 250 is turned off. However, it is generally 
desirable to have some type of long term mass storage 268 such as a commercially 
5 available hard disk drive, nonvolatile memory such as flash memory, battery backed 
RAM, PC-data cards, or the like. 

A removable storage read/write device 269 may be coupled to I/O circuitry 
260 to read from and to write to a removable storage media 271. Removable 
storage media 271 may represent, for example, a magnetic disk, a magnetic tape, 

10 an opto-magnetic disk, an optical disk, or the like. Instructions for implementing the 
inventive method may be provided, in one embodiment, to a network via such a 
removable storage media. 

In operation, information is input into the computer system 250 by typing on 
a keyboard, manipulating a mouse or trackball, or "writing" on a tablet or on 

15 position-sensing screen of display assembly 262. CPU 252 then processes the data 
under control of an operating system and an application program, such as a program 
to perform some or all of the steps of the inventive methods described above, stored 
in ROM 254 and/or RAM 256. It is noted that such process steps may initially be 
stored in mass storage 268, downloaded into RAM 256 and then executed out of 

20 RAM 256. CPU 252 then typically produces data which is output to the display 
assembly 262 to produce appropriate images on its screen. 

Expansion bus 266 is coupled to data bus 272, control bus 274, and address 
bus 276. Expansion bus 266 provides extra ports to couple devices such as network 
interface circuits, modems, display switches, microphones, speakers, etc. to CPU 

25 252. Network communication is accomplished through the network interface circuit 
and an appropriate network. 

Suitable computers for use in implementing the present invention may be 
obtained from various vendors. Various computers, however, may be used 
depending upon the size and complexity of the tasks. Suitable computers include 

30 mainframe computers, multiprocessor computers, workstations or personal 
computers. In addition, although a general purpose computer system has been 
described above, a special-purpose computer may also be used. 
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It should be understood that the present invention also relates to machine 
readable media on which are stored program instructions for performing some or all 
of the methods of this invention. Such media include, by way of example, magnetic 
disks, magnetic tape, optically readable media such as CD ROMs, semiconductor 
5 memory such as PCMCIA cards, etc. In each case, the medium may take the form 
of a portable item such as a small disk, diskette, cassette, etc., or it may take the 
form of a relatively larger or immobile item such as a hard disk drive or RAM 
provided in a computer. 

10 Conclusion . 

Although the present invention has been described in detail with regard to the 
exemplary embodiments and drawings thereof, it should be apparent to those skilled 
in the art that various adaptations and modifications of the present invention may be 
accomplished without departing from the spirit and the scope of the invention. 

15 Accordingly, the invention is not limited to the precise embodiments shown in the 
drawings and described in detail above. Rather, it is intended that all such variations 
not departing from the spirit of the invention be considered as within the scope 
thereof as limited solely by the claims appended hereto. 

Also, several different embodiments of the present invention are described 

20 above, with each such embodiment described as including certain features. 
However, it is intended that the features described in connection with the discussion 
of any single embodiment are not limited to that embodiment but may be included 
and/or arranged in various combinations in any of the other embodiments as well, 
as will be understood those skilled in the art. 
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CLAIMS 

What is claimed is: 

1 . A method for classifying stocks into business sectors, said method 
comprising: 

(a) calculating, for each of plural exogenous variables, a measure of a 
tendency for a value of a stock to change as a result of a change in a data value for 

5 said each exogenous variable; 

(b) repeating step (a) for each of plural different stocks; and 

(c) grouping said plural different stocks into plural different sectors based on 
similarities of said measures of tendency to change. 

2. A method according to Claim 1 , wherein said measure of tendency to 
change comprises a measure of elasticity. 

3. A method according to Claim 1 , wherein step (a) comprises: 

(a1 ) processing historical data for value of the stock and historical data values 
for said plural exogenous variables to obtain a price formula for estimating the value 
of the stock as a function of the exogenous variables; and 
5 (a2) taking a derivative of the price formula to obtain a formula expressing 

said tendency to change. 

4. A method according to Claim 3, wherein step (a1) comprises 
performing a statistical regression technique. 

5. A method according to Claim 3, wherein said price formula is 
expressed as a truncated Taylor series expansion. 

6. A method according to Claim 1 , wherein step (c) comprises performing 
a statistical clustering technique whereby said plural different sectors are defined by 
clusters resulting from said statistical clustering technique. 



0191802.1 



33 



35512-00034 



7. A method according to Claim 1 , wherein step (c) comprises performing 
a statistical regression technique. 

8. A method according to Claim 1 , further comprising a step of calculating 
a representative characteristic of stocks in a specific sector used in step (c). 

9. A method according to Claim 8, further comprising a step of comparing 
a characteristic of a specific stock in said specific sector to the representative 
characteristic of stocks in said specific sector. 

1 0. A method according to Claim 9, further comprising a step of purchasing 
an asset based on a result of said step of comparing. 

11. A method according to Claim 8, wherein said representative 
characteristic comprises an average return to stocks in said specific sector. 

12. A method according to Claim 11, wherein said average return is 
calculated using a weighted average. 

13. A method according to Claim 1 , further comprising a step of periodically 
repeating steps (a) through (c). 

14. A method according to Claim 13, further comprising a step of tracking 
a position of a particular stock overtime relative to its assigned sector. 

15. A method according to Claim 13, further comprising a step of tracking 
reclassification of a particular stock from a first sector to a second sector. 

16. A method according to Claim 1, wherein step (a) comprises 
determining a formula for calculating said measure of tendency to change, said 
formula being a function of said exogenous variables. 
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17. A method according to Claim 16, further comprising steps of: 
calculating plural samples of said measure of tendency to change using said 

formula for each of said plural different stocks; and 

using said samples in step (c) for grouping said plural different stocks into said 
5 plural different sectors. 

18. A method according to Claim 17, wherein said measure of tendency 
to change is calculated in step (a) for each of the plural different stocks using 
historical data values for said exogenous variables over a same period of time. 

19. A method according to Claim 18, wherein said samples are taken from 
a region of a multi-dimensional space defined by said exogenous variables in which 
the historical data values for said exogenous variables used in step (a) are clustered. 

20. A method according to Claim 1 , step (a) comprises a step of processing 
historical data for value of the stock and historical data values for said plural 
exogenous variables to obtain a price formula for estimating the value of the stock 
as a function of the exogenous variables. 

21 . A method according to Claim 20, wherein said price formula is obtained 
by performing neural network processing. 

22. A method according to Claim 21 , wherein said measure of tendency 
to change is calculated by inputting different data values for the exogenous variables 
and observing how an output of said price formula changes as a result of small 
changes in the data values for the exogenous variables. 

23. A method according to Claim 20, wherein said price formula is obtained 
by using a genetic algorithm. 

24. An apparatus for classifying stocks into business sectors, said 
apparatus comprising: 
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(a) means for calculating, for each of plural exogenous variables, a measure 
of a tendency for a value of a stock to change as a result of a change in a data value 

5 for said each exogenous variable; 

(b) means for repeating the calculating performed by means (a) for each of 
plural different stocks; and 

(c) means for grouping said plural different stocks into plural different sectors 
based on similarities of said measures of tendency to change. 

25. A computer-readable medium storing computer-executable process 
steps for classifying stocks into business sectors, said process steps comprising 
steps to: 

(a) calculate, for each of plural exogenous variables, a measure of a 
5 tendency for a value of a stock to change as a result of a change in a data value for 

said each exogenous variable; 

(b) repeat step (a) for each of plural different stocks; and 

(c) group said plural different stocks into plural different sectors based on 
similarities of said measures of tendency to change. 
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ABSTRACT OF THE DISCLOSURE 

Provided is a technique for classifying stocks into business sectors by 
calculating, for each of multiple exogenous variables, a measure of a tendency for 
5 a value of a stock to change as a result of a change in a data value for each such 
exogenous variable. The foregoing step is then repeated for each of several 
different stocks. Finally, the different stocks are grouped into different sectors based 
on similarities of such measures of tendency to change. 
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accordance with Title 37, Code of Federal Regulations, § 1 .56(a). 
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application: 
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as my attorneys with full power of substitution and revocation, to prosecute this application and to transact in 
connection therewith all business in the United States Patent and Trademark Office and before competent 
International Authorities. 

Please send all correspondence to: 

Steven E. Shapiro, Esq. 
Mitchell, Silberberg & Knupp LLP 
1 1377 West Olympic Boulevard 
Los Angeles, California 90064 
(310)312-2000 



Wherefore I pray that Letters Patent be granted to me for the invention or discovery described and claimed 
in the foregoing specification and claims, and i hereby subscribe my name to the foregoing specification and claims, 
declaration, power of attorney, and this petition. 

Listing of Inventors Continued on Page 3 hereof [ ] Yes [X] No 



Full name of first inventor ^ Stephen A. Klein 



Inventor's signature ..^^f^K^ U 
Residence Pasadena, California / 



Date 



Citizenship United States 



Post Office Address 448 S. Santa Anita Avenue. Pasadena. California 91107 



Full name of secondtoffifltor G. Michael Phillips 

Inventor's signature s ,Jf/^ Date 7~ ^ 

Residence Pasipii#, California 

Citizenship " United States 

Post Office Address 3580 Cartwright Street, Pasadena, California 91 107 



Full name of third inventor _ 

Inventor's signature 
Residence Simi Valley, Californi; 



William P. Jennings 



Citizenship United States 




Date 



Post Office Address 3072 Kilaine Drive, Simi Valley, California 93063 



Full name of fourth iriv 




Inventor's signature 
Residence Los 
Citizenship United States 



Post Office Address 3606 Amesburv Road, Los Angeles, California 90027 



Full name of fifth inventor Mark E. Rice 



Inventor's signature 

Residence Pasadena, California 



intor MarK b. Kice 

AJU^U Z (luc^ Date jj^/od 



Citizenship United States 



Post Office Address 763 E. California Boulevard, Pasadena, California 91106 



0250299 1 



